refactor code to make the API harder to misuse #261

MarcoPolo · 2025-01-11T20:08:26Z

This seeks to refactor the codebase to make it much harder to hit nil pointer dereference panics.

This takes a different approach to how we've treated multiaddrs in the past. Instead of attempting to make them a general and performant datastructure, we focus on treating them as just an encoding scheme. Users of multiaddrs are expected to parse the multiaddr into some struct that is suitable for their use case, and use the multiaddr form when interoperating. By treating Multiaddrs as just an encoding scheme we can make a number of simplifications in the codebase. Specifically this PR does the following:

Removes the Multiaddrinterface.
Multiaddr is now a concrete type of []Component
Componentis now a public type, and users can use Components directly
Components no longer implement the Multiaddrinterface as there is none.

Background

This library has had multiple issues related to Multiaddr being an interface. Many methods use and return nil as the zero value, which behaves poorly when the user forgets to do a nil check on every returned value and attempts to call a method on the nil pointer. For example, using Split to split a Multiaddr and then using Join to rebuild the original Multiaddr historically would panic in case one side of the split was nil. Using an interface also leads to incorrect usages of == to check if two Multiaddrs were equal (would only work for pointer equality) and incorrectly using Multiaddr as a key for a map.

Using an interface is typically done to provide a consistent API surface for multiple implementing types. In practice however, the Multiaddr interface was only implemented for multiaddr and component (with arguably some awkwardness when using a component as a Multiaddr).

The better approach is to use a concrete type for a Multiaddr. This lets pointer receiver methods work even if the pointer is nil, since the compiler already knows which function to call. Most methods now take a value rather than a pointer which avoids the issue of a nil pointer dereference completely.

Migration

Refer to ./v015-MIGRATION.md for breaking changes and migration tips

multiaddr.go

sukunrt · 2025-01-27T16:34:56Z

util.go

@@ -70,135 +62,61 @@ func StringCast(s string) Multiaddr {
 }

 // SplitFirst returns the first component and the rest of the multiaddr.
-func SplitFirst(m Multiaddr) (*Component, Multiaddr) {


Maybe it's better to return (Multiaddr, Multiaddr)
The first multiaddr will be []Component{first}

With the current API, most users will need to do component.AsMultiaddr here.

This does break users who were expecting a component, but I believe that's the less popular usecase.

I made this change and backed it out. I think a lot of usages was to get the component out, not a Multiaddr.

For example this is a pretty common pattern:

transport, p2ppart := ma.SplitLast(m) if p2ppart == nil || p2ppart.Protocol().Code != ma.P_P2P { return m, "" }

If we return a Multiaddr type, you can't use the Protocol() method like you were able to before.

This pattern mostly still works after this refactor, you just have to check p2ppart.Empty() instead of == nil

MarcoPolo added 8 commits January 10, 2025 16:14

Remove Multiaddr interface

7bc8264

Refactor to make Multiaddr = []Component

eeaa191

update net package

a0b10fa

remove extra code

ef9995f

Add breaking text

0e45aea

remove panic

2b74265

add test for using nil multiaddr

4c69e16

Handle other error cases

4ab593a

2color mentioned this pull request Jan 13, 2025

[DISCUSSION] Exposing the underlying struct #100

Open

MarcoPolo added 8 commits January 14, 2025 14:01

skip empty components

c268d44

Remove ForEach usage

5c40c40

Encapsulate is the same as Join

fbef51e

Use nil as zero value for Multiaddr

3ca4833

Add matest package for multiaddr testing utilities

f9bebb2

remove ForEach usages

e32f70d

undo the deprecate ForEach until we have meg

3663997

explicit err check

293f2c0

MarcoPolo marked this pull request as ready for review January 21, 2025 18:45

MarcoPolo requested a review from sukunrt January 21, 2025 18:45

sukunrt reviewed Jan 27, 2025

View reviewed changes

nits

f43a278

MarcoPolo force-pushed the marco/multiaddr-refactor branch from c652d4a to f43a278 Compare January 29, 2025 19:56

MarcoPolo requested a review from sukunrt January 29, 2025 19:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor code to make the API harder to misuse #261

refactor code to make the API harder to misuse #261

MarcoPolo commented Jan 11, 2025 •

edited

Loading

sukunrt Jan 27, 2025

MarcoPolo Jan 29, 2025

refactor code to make the API harder to misuse #261

Are you sure you want to change the base?

refactor code to make the API harder to misuse #261

Conversation

MarcoPolo commented Jan 11, 2025 • edited Loading

Background

Migration

sukunrt Jan 27, 2025

Choose a reason for hiding this comment

MarcoPolo Jan 29, 2025

Choose a reason for hiding this comment

MarcoPolo commented Jan 11, 2025 •

edited

Loading